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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

AGCTCGCTGA GACTTCCTGG ACCCCGCACC AGGCTGTGGG GTTTCTCAGA TAACTGGGCC 60 

~*C>. X >- ^TGCGCTCA GGAGGCCTTC ACCCTCTGCT CTGGGTAAAG TTCATTGGAA CAGAAAGAAA~ 120 

;TGGATTTATC TGCTCTTCGC GTTGAAGAAG TACAAAATGT CATTAATGCT ATGCAGAAaT 180 

10 TCTTAGAGTG TCCCATCTGT CTGGAGTTGA TCAAGGAACC TGTCTCCACA AAGTGTGACC 240 

U ACATATTTTG CAAATTTTGC ATGCTGAAAC TTCTCAACCA GAAGAAAGGG CCTTCACAGT 3 00 

VJ GTCCTTTATG TAAGAATGAT ATAACCAAAA GGAGCCTACA AGAAAGTACG AGATTTAGTC 3 60 

W AACTTGTTGA AGAGCTATTG AAAATCATTT GTGCTTTTCA GCTTGACACA GGTTTGGAGT 4 20 

m ATGCAAACAG CTATAATTTT GCAAAAAAGG AAAATAACTC TCCTGAACAT CTAAAAGATG 4 80 

]f AAGTTTCTAT CATCCAAAGT ATGGGCTACA GAAACCGTGC CAAAAGACTT CTACAGAGTG 5 40 

gi AACCCGAAAA TCCTTCCTTG CAGGAAACCA GTCTCAGTGT CCAACTCTCT AACCTTGGAA 600 

yt CTGTGAGAAC TCTGAGGACA AAGCAGCGGA TACAACCTCA AAAGACGTCT GTCTACATTG 6 60 

IJ! AATTGGGATC TGATTCTTCT GAAGATACCG TTAATAAGGC AACTTATTGC AGTGTGGGAG 720 

, ATCAAGAATT GTTACAAATC ACCCCTCAAG GAACCAGGGA TGAAATCAGT TTGGATTCTG 7 80 

IF CAAAAAAGGC TGCTTGTGAA TTTTCTGAGA CGGATGTAAC AAATACTGAA CATCATCAAC 840 

3 CCAGTAATAA TGATTTGAAC ACCACTGAGA AGCGTGCAGC TGAGAGGCAT CCAGAAAAGT 900 

ay ATCAGGGTAG TTCTGTTTCA AACTTGCATG TGGAGCCATG TGGCACAAAT ACTCATGCCA 960 

GCTCATTACA GCATGAGAAC AGCAGTTTAT TACTCACTAA AGACAGAATG AAT GTAG AAA 1020 

AGGCTGAATT CTGTAATAAA AGCAAACAGC CTGGCTTAGC AAGGAGCCAA CATAACAGAT 108 0 

GGGCTGGAAG TAAGGAAACA TGTAATGATA GGCGGACTCC CAGCACAGAA AAAAAGGTAG 114 0 

ATCTGAATGC TGATCCCCTG TGTGAGAGAA AAGAATGGAA TAAGCAGAAA CTGCCATGCT 1200 

CAGAGAATCC TAG AG AT ACT GAAGATGTTC CTTGGATAAC ACTAAATAGC AGCATTCAGA 12 60 

AAGTTAATGA GTGGTTTTCC AGAAGTGATG AACTGTTAGG TTCTGATGAC TCACATGATG 132 0 

GGGAGTCTGA ATCAAATGCC AAAGTAGCTG ATGTATTGGA CGTTCTAAAT GAGGTAGATG 1380 

AATATTCTGG TTCTTCAGAG AAAATAGACT TACTGGCCAG TGATCCTCAT GAGGCTTTAA 14 4 0 

TATGTAAAAG TGAAAGAGTT CACTCCAAAT CAGTAGAGAG TAATATTGAA GACAAAATAT 1500 

TTGGGAAAAC CTATCGGAAG AAGGCAAGCC TCCCCAACTT AAGCCATGTA ACTGAAAATC 1560 

TAATTATAGG AGCATTTGTT ACTGAGCCAC AGATAATACA AGAGCGTCCC CTCACAAATA 162 0 

AATTAAAGCG TAAAAGGAGA CCTACATCAG GCCTTCATCC TGAGGATTTT ATCAAGAAAG 168 0 

■55 CAGATTTGGC AGTTCAAAAG ACTCCTGAAA TGATAAATCA GGGAACTAAC CAAACGGAGC 174 0 

AGAATGGTCA AGTGATGAAT ATTACTAATA GTGGTCATGA GAATAAAACA AAAGGTGATT 1800 

CTATTCAGAA TGAGAAAAAT CCTAACCCAA TAGAATCACT CGAAAAAGAA TCTGCTTTCA 1860 

AAACGAAAGC TGAACCTATA AGCAGCAGTA TAAGCAATAT GGAACTCGAA TTAAATATCC 192 0 

ACAATTCAAA AGCACCTAAA AAGAATAGGC TGAGGAGGAA GTCTTCTACC AGGCATATTC 198 0 

ATGCGCTTGA AC TAG TAG TC AGTAGAAATC TAAGCCCACC TAATTGTACT GAATTGCAAA 204 0 

TTGATAGTTG TTCTAGCAGT GAAGAGATAA AGAAAAAAAA GTACAACCAA ATGCCAGTCA 210 0 

GGCACAGCAG AAACCTACAA CTCATGGAAG GTAAAGAACC TGCAACTGGA GCCAAGAAGA 2160 

GTAACAAGCC AAATGAACAG ACAAGTAAAA G AC AT G AC AG TGATACTTTC CCAGAGCTGA 2220 

AGTTAACAAA TGCACCTGGT TCTTTTACTA AGTGTTCAAA TACCAGTGAA CTTAAAGAAT 2280 

« TTGTCAATCC TAGCCTTCCA AGAGAAGAAA AAGAAGAGAA ACTAGAAACA GTTAAAGTGT 234 0 
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CTAATAATGC TGAAGACCCC AAAGATCTCA TGTTAAGTGG AGAAAGGGTT TTGCAAACTG 24 00 
AAAGATCTGT AGAGAGTAGC AGTATTTCAC TGGTACCTGG TACTGATTAT GGCACTCAGG 24 60 
AAAGTATCTC GTTACTGGAA GTTAGCACTC TAGGGAAGGC AAAAACAGAA CCAAATAAAT 2 520 
GTGTGAGTCA GTGTGCAGCA TTTGAAAACC CCAAGGGACT AATTCATGGT TGTTCCAAAG 2580 
ATAATAGAAA TGACACAGAA GGCTTTAAGT ATCCATTGGG ACATGAAGTT AACCACAGTC 2 64 0 
GGGAAACAAG CATAGAAATG GAAGAAAGTG AACTTGATGC TCAGTATTTG CAGAATACAT 2 700 
TCAAGGTTTC AAAGCGCCAG TCATTTGCTC TGTTTTCAAA TCCAGGAAAT GCAGAAGAGG 2760 
AATGTGCAAC ATTCTCTGCC CACTCTGGGT CCTTAAAGAA ACAAAGTCCA AAAGTCACTT 2 820 
TTGAATGTGA ACAAAAGGAA GAAAATCAAG GAAAGAATGA GTCTAATATC AAGCCTGTAC 2880 

AGACAGTTAA TATCACTGCA GGCTTTCCTG TGGTTGGTCA GAAAGATAAG CCAGTTGATA 2 94 0 

ATGCCAAATG TAGTATCAAA GGAGGCTCTA GGTTTTGTCT AT CAT C T C AG TTCAGAGGCA 3000 

ACGAAACTGG ACTCATTACT CCAAATAAAC ATGGACTTTT ACAAAACCCA TATCGTATAC 3060 

;;;; caccactttt tcccatcaag tcatttgtta aaactaaatg taagaaaaat ctgctagagg 3120 

*T AAAACTTTGA GGAACATTCA ATGTCACCTG AAAGAGAAAT GGGAAATGAG AACATTCCAA 318 0 

J«> GTACAGTGAG CACAATTAGC CGTAATAACA TTAGAGAAAA TGTTTTTAAA GGAGCCAGCT 32 4 0 

CAAGCAATAT TAATGAAGTA GGTTCCAGTA CTAATGAAGT GGGCTCCAGT ATTAATGAAA 3300 

|U TAGGTTCCAG TGATGAAAAC ATTCAAGCAG AACTAGGTAG AAACAGAGGG CCAAAATTGA 33 60 

«J ATGCTATGCT TAGATTAGGG GTTTTGCAAC CTGAGGTCTA TAAACAAAGT CTTCCTGGAA 34 20 

IA| GTAATTGTAA GCATCCTGAA ATAAAAAAGC AAGAATATGA AGAAGTAGTT CAGACTGTTA 34 80 

HP ATACAGATTT CTCTCCATAT CTGATTTCAG ATAACTTAGA ACAGCCTATG GGAAGTAGTC 35 4 0 

a ATGCATCTCA GGTTTGTTCT GAGACACCTG ATGACCTGTT AGATGATGGT GAAATAAAGG 3600 

j>i AAGATACTAG TTTTGCTGAA AATGACATTA AGGAAAGTTC TGCTGTTTTT AGCAAAAGCG 3660 

□ TCCAGAGAGG AGAGCTTAGC AGGAGTCCTA GCCCTTTCAC CCATACACAT TTGGCTCAGG 3720 

Hi GTTACCGAAG AGGGGCCAAG AAATTAGAGT CCTCAGAAGA GAACTTATCT AGTGAGGATG 37 8 0 

Ft? AAGAGCTTCC CTGCTTCCAA CACTTGTTAT TTGGTAAAGT AAACAATATA CCTTCTCAGT 38 4 0 

p CTACTAGGCA TAGCACCGTT GCTACCGAGT GTCTGTCTAA GAACACAGAG GAGAATTTAT 3 900 

u TATCATTGAA GAATAGCTTA AATGACTGCA GTAACCAGGT AATATTGGCA AAGGCATCTC 3 960 

AGGAACATCA CCTTAGTGAG GAAACAAAAT GTTCTGCTAG CTTGTTTTCT TCACAGTGCA 4 020 

GTGAATTGGA AGACTTGACT GCAAATACAA ACACCCAGGA TCCTTTCTTG ATTGGTTCTT 4 080 

30 CCAAACAAAT GAGGCATCAG TCTGAAAGCC AGGGAGTTGG TCTGAGTGAC AAGGAATTGG 414 0 

TTTCAGATGA TGAAGAAAGA GGAACGGGCT TGGAAGAAAA TAATCAAGAA GAGCAAAGCA 4 200 

TGGATTCAAA CTTAGGTGAA GCAGCATCTG GGTGTGAGAG TGAAACAAGC GTCTCTGAAG 4 2 60 

ACTGCTCAGG GCTATCCTCT CAGAGTGACA TTTTAACCAC TCAGCAGAGG GATACCATGC 4 320 

AACATAACCT GATAAAGCTC CAGCAGGAAA TGGCTGAACT AGAAGCTGTG TTAGAACAGC 4 38 0 

35 ATGGGAGCCA GCCTTCTAAC AGCTACCCTT CC*ATCATAAG TGACTCCTCT GCCCTTGAGG 4 440 

ACCTGCGAAA TCCAGAACAA AGCACATCAG AAAAAGCAGT ATTAACTTCA CAGAAAAGTA 4 500 

GTGAATACCC TATAAGCCAG AATCCAGAAG GCCTTTCTGC TGACAAGTTT GAGGTGTCTG 4 5 60 

CAGATAGTTC TACCAGTAAA AATAAAGAAC CAGGAGTGGA AAGGTCATCC CCTTCTAAAT 4 620 

GCCCATCATT AG AT GAT AGG TGGTACATGC ACAGTTGCTC TGGGAGTCTT CAGAATAGAA 4 680 

ACTACCCATC TCAAGAGGAG CTCATTAAGG TTGTTGATGT GGAGGAGCAA CAGCTGGAAG 4 740 

AGTCTGGGCC ACACGATTTG ACGGAAACAT CTTACTTGCC AAGGCAAGAT CTAGAGGGAA 4 800 

CCCCTTACCT GGAATCTGGA ATCAGCCTCT TCTCTGATGA CCCTGAATCT GATCCTTCTG 4 8 60 

AAGACAGAGC CCCAGAGTCA GCTCGTGTTG GCAACATACC ATCTTCAACC TCTGCATTGA 4 920 

AAGTTCCCCA ATTGAAAGTT GCAGAATCTG CCCAGGGTCC AGCTGCTGCT CATACTACTG 4 980 

45 ATACTGCTGG GTATAATGCA ATGGAAGAAA GTGTGAGCAG GGAGAAGCCA GAATTGACAG 504 0 

CTTCAACAGA AAGGGTCAAC AAAAGAATGT CCATGGTGGT GTCTGGCCTG ACCCCAGAAG 5100 

AATTTATGCT CGTGTACAAG TTTGCCAGAA AACACCACAT CACTTTAACT AATCTAATTA 5160 
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CTGAAGAGAC TACTCATGTT GTTATGAAAA CAGATGCTGA GTTTGTGTGT GAACGGACAC 5 22 0 

TGAAATATTT TCTAGGAATT GCGGGAGGAA AATGGGTAGT TAGCTATTTC TGGGTGACCC 5 28 0 

AGTCTATTAA AGAAAGAAAA ATGCTGAATG AGCATGATTT TGAAGTCAGA GGAGATGTGG 5 34 0 

TCAATGGAAG AAACCACCAA GGTCCAAAGC GAGCAAGAGA ATCCCAGGAC AGAAAGATCT 5 4 00 

TCAGGGGGCT AGAAATCTGT TGCTATGGGC CCTTCACCAA CATGCCCACA GATCAACTGG 5 4 60 

AATGGATGGT ACAGCTGTGT GGTGCTTCTG TGGTGAAGGA GCTTTCATCA TTCACCCTTG 5 52 0 

GCACAGGTGT CCACCCAATT GTGGTTGTGC AGCCAGATGC CTGGACAGAG GACAATGGCT 5 58 0 

TCCATGCAAT TGGGCAGATG TGTGAGGCAC CTGTGGTGAC CCGAGAGTGG GTGTTGGACA 5 64 0 

GTGTAGCACT CTACCAGTGC CAGGAGCTGG ACACCTACCT GATACCCCAG ATCCCCCACA 5 700 

GCCACTACTG A ,--. in 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1863 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met 


Asp 


Leu 


Ser 


Ala 


Leu 


Arg 


Val 


Glu 


Glu 


Val 


Gin 


Asn 


Val 


He 


Asn 


1 








5 










10 










15 




Ala 


Met 
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Lys 


He 


Leu 


Glu 


Cys 


Pro 


He 


Cys 


Leu 


Glu 


Leu 


He 


Lys 








20 










25 










30 




Glu 


Pro 


Val 


Ser 


Thr 


Lys 


Cys 


Asp 


His 


He 


Phe 


Cys 


Lys 


Phe 


Cys 


Met 






35 










40 










4 5 






Leu 


Lys 


Leu 


Leu 


Asn 


Gin 


,Lys 


Lys 


Gly 


Pro 


Ser 


Gin 


Cys 


Pro 


Leu 


Cys 




50 










55 










60 






Lys 


Asn 


Asp 


He 


Thr 


Lys 


Arg 


Ser 


Leu 


Gin 


Glu 


Ser 


Thr 


Arg 


Phe 


Ser 


65 










70 










75 








80 


Gin 


Leu 


Val 


Glu 


Glu 


Leu 


Leu 


Lys 


He 


lie 


Cys 


Ala 


Phe 


Gin 


Leu 


Asp 










85 








• 


90 










95 


Thr 


Gly 


Leu 


Glu 


Tyr 


Ala 


Asn 


Ser 


Tyr 


Asn 


Phe 


Ala 


Lys 


Lys 


Glu 


Asn 








100 










105 








110 






Asn 


Ser 


Pro 


Glu 


His 


Leu 


Lys 


Asp 


Glu 


Val 


Ser 


He 


He 


Gin 


Ser 


Met 






115 










120 










125 








Gly 


Tyr 


Arg 


Asn 


Arg 


Ala 


Lys 


Arg 


Leu 


Leu 


Gin 


Ser 


Glu 


Pro 


Glu 


Asn 




130 










135 










140 










Pro 


Ser 


Leu 


Gin 


Glu 


Thr 


Ser 


Leu 


Ser 


Val 


Gin 


Leu 


Ser 


Asn 


Leu 


Gly 


145 










150 










155 










160 


Thr 


Val 


Arg 


Thr 


Leu 


Arg 


Thr 


Lys 


Gin 


Arg 


He 


Gin 


Pro 


Gin 


Lys 


Thr 










165 










170 










175 




Ser 


Val 


Tyr 


He 


Glu 


Leu 


Gly Ser 


Asp 


Ser 


Ser 


Glu 


Asp 


Thr 


Val 


Asn 



180 185 190 
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Lys Ala Thr Tyr 
195 

Pro Gin Gly Thr 
210 

Ala Cys Glu Phe 
225 

Pro Ser Asn Asn 

His Pro Glu Lys 
260 

Pro Cys Gly Thr 
275 

Ser Leu Leu Leu 
290 

Cys Asn Lys Ser 
305 

Trp Ala Gly Ser 

Glu Lys Lys Val 
340 

Trp Asn Lys Gin 
355 

Asp Val Pro Trp 
370 

Trp Phe Ser Arg 
385 

Gly Glu Ser Glu 

Asn Glu Val Asp 
420 

Ala Ser Asp Pro 
435 

Ser Lys Ser Val 
450 

Tyr Arg Lys Lys 
465 

Leu lie He Gly 

Pro Leu Thr Asn 
500 

His Pro Glu Asp 
515 

Pro Glu Met He 
530 

Val Met Asn He 
545 

Ser He Gin Asn 



Cys Ser Val Gly 
200 

Arg Asp Glu Tie 
215 

Ser Glu Thr Asp 
230 

Asp Leu Asn Thr 
245 

Tyr Gin Gly Ser 

Asn Thr His Ala 
280 

Thr Lys Asp Arg 
295 

Lys Gin Pro Gly 
310 

Lys Glu Thr Cys 
325 

Asp Leu Asn Ala 

Lys Leu Pro Cys 
360 

He Thr Leu Asn 
375 

Ser Asp Glu Leu 
390 

Ser Asn Ala Lys 
405 

Glu Tyr Ser Gly 

His Glu Ala Leu 
440 

Glu Ser Asn He 
455 

Ala Ser Leu Pro 
470 

Ala Phe Val Thr 
485 

Lys Leu Lys Arg 

Phe He Lys Lys 
520 

Asn Gin Gly Thr 
535 

Thr Asn Ser Gly 
550 

Glu Lys Asn Pro 



Asp Gin Glu Leu 

Ser Leu Asp Ser 
220 

Val Thr Asn Thr 
235 

Thr Glu Lys Arg 
250 

Ser Val Ser Asn 
265 

Ser Ser Leu Gin 

Met Asn Val Glu 
300 

Leu Ala Arg Ser 
315 

Asn Asp Arg Arg 
330 

Asp Pro Leu Cys 
345 

Ser Glu Asn Pro 

Ser Ser He Gin 
380 

Leu Gly Ser Asp 
395 

Val Ala Asp Val 
410 

Ser Ser Glu Lys 
425 

He Cys Lys Ser 

Glu Asp Lys. He 
460 

Asn Leu Ser His 
475 

Glu Pro Gin He 
490 

Lys Arg Arg Pro 
505 

Ala Asp Leu Ala 

Asn Gin Thr Glu 
540 

His Glu Asn Lys 
555 

Asn Pro He Glu 



Leu Gin He Thr 
205 

Ala Lys Lys Ala 

Glu His His Gin 
240 

Ala Ala Glu Arg 
255 

Leu His Val Glu 
270 

His Glu Asn Ser 
285 

Lys Ala Glu Phe 

Gin His Asn Arg 
320 

Thr Pro Ser Thr 
335 

Glu Arg Lys Glu 
350 

Arg Asp Thr Glu 
365 

Lys Val Asn Glu 

Asp Ser His Asp 
400 

Leu Asp Val Leu 
415 

He Asp Leu Leu 
430 

Glu Arg Val His 
445 

Phe Gly Lys Thr 

Val Thr Glu Asn 
480 

He Gin Glu Arg 
495 

Thr Ser Gly Leu 
510 

Val Gin Lys Thr 
525 

Gin Asn Gly Gin 

Thr Lys Gly Asp 
560 

Ser Leu Glu Lys 
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565 570 575 

Glu Ser Ala Phe Ly 3 Thr Lys Ala Glu Pro lie Ser Ser Ser He Ser 

580 585 590 

Asn Met Glu Leu Glu Leu Asn He His Asn Ser Lys Ala Pro Lys Lys 

595 600 605 

Asn Arg Leu Arg Arg Lys Ser Ser Thr Arg His He His Ala Leu Glu 
r ,! ? 615 620 

Leu Val Val Ser Arg Asn Leu Ser Pro Pro Asn Cys Thr Glu Leu Gin 

630 «5 640 

He Asp Ser Cys Ser Ser Ser Glu Glu He Lys Lys Lys Lys Tyr Asn 

645 650 655 

Gin Met Pro Val Arg His Ser Arg Asn Leu Gin Leu Met Glu Gly Lys 

660 6 65 670 

Glu Pro Ala Thr Gly Ala Lys Lys Ser Asn Lys Pro Asn Glu Gin Thr 
675 680 685 

Ser Lys Arg His Asp Ser Asp Thr Phe Pro Glu Leu Lys Leu Thr Asn 
690 695 70Q 

Ala Pro Gly Ser Phe Thr Lys Cys Ser Asn Thr Ser Glu Leu Lys Glu 

710 . 715 

Phe Val Asn Pro Ser Leu Pro Arg Glu Glu Lys Glu Glu Lys Leu Glu 

725 730 735 

Thr Val Lys Val Ser Asn Asn Ala Glu Asp Pro Lys Asp Leu Met Leu 

740 745 750 

Ser Gly Glu Arg Val Leu Gin Thr Glu Arg Ser Val Glu Ser Ser Ser 

755 760 765 

He Ser Leu Val Pro Gly Thr Asp Tyr Gly Thr Gin Glu Ser He Ser 

770 775 780 

Leu Leu Glu Val Ser Thr Leu Gly Lys Ala Lys Thr Glu Pro Asn Lys 
1 790 795 goo 

Cys Val Ser Gin Cys Ala Ala Phe Glu Asn Pro Lys Gly Leu lie His 

805 810 815 

Gly Cys Ser Lys Asp Asn Arg Asn Asp Thr Glu Gly Phe Lys Tyr Pro 

820 825 830 

Leu Gly His Glu Val Asn His Ser Arg Glu Thr Ser He Glu Met Glu 

835 840 * 845 

Glu Ser Glu Leu Asp Ala Gin Tyr Leu Gin Asn Thr Phe Lys Val Ser 

850 855 860 

Lys Arg Gin Ser Phe Ala Leu Phe Ser Asn Pro Gly Asn Ala Glu Glu 

870 875 880 

Glu Cys Ala Thr Phe Ser Ala His Ser Gly Ser Leu Lys Lys Gin Ser 

885 890 895 

Pro Lys Val Thr Phe Glu Cys Glu Gin Lys Glu Glu Asn Gin Gly Lys 

900 905 910 Y y 

Asn Glu Ser Asn He Lys Pro Val Gin Thr Val Asn He Thr Ala Glv 

915 920 925 

Phe Pro Val Val Gly Gin Lys Asp Lys Pro Val Asp Asn Ala Lys Cys 
930 935 940 
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Ser He Lys Gly Gly Se r Arg Phe Cys Leu Ser Ser Gin Phe Arg Gly 

950 955 960 

Asn Glu Thr Gly Leu He Thr Pro Asn Lys His Gly Leu Leu Gin Asn 

970 975 
Pro Tyr Arg lie Pro Pro Leu Phe Pro He Lys Ser Phe Val Lys Thr 

980 985 990 

Lys Cys Lys Lys Asn Leu Leu Glu Glu Asn Phe Glu Glu His Ser Met 

"5 1000 1005 

Ser Pro Glu Arg Glu Met Gly Asn Glu Asn He Pro Ser Thr Val Ser 

1010 1015 102 0 

Thr lie Ser Arg Asn Asn He Arg Glu Asn Val Phe Lys Gly Ala Ser 

c„ 1030 1035 104 

Ser Ser Asn He Asn Glu Val Gly Ser Ser Thr Asn Glu Val Gly Ser 

1045 1050 1055 

Ser He Asn Glu He Gly Ser Ser Asp Glu Asn lie Gin Ala Glu Leu 

1060 1065 1070 

Gly Arg Asn Arg Gly Pro Lys Leu Asn Ala Met Leu Arg Leu Gly Val 
1075 108 O 1085 

on LyS Gln Ser LSU Pr ° G1 V Ser ^n Cys Lys 

1090 1095 1100 

His Pro Glu He Lys Lys Gin Glu Tyr Glu Glu Val Val Gin Thr Val 

I 1110 1H5 112 

Asn Thr Asp Phe Ser^Pro Tyr Leu He Ser Asp Asn Leu Glu Gin Pro 

Met Gly Ser Ser His Ala Ser Gin Val Cys Ser Glu Thr Pro Asp^Asp 

II 40 1145 i 150 

Leu Leu Asp Asp Gly Glu He Lys Glu Asp Thr Ser Phe Ala Glu Asn 

II 55 1160 1165 

Asp lie Lys Glu Ser Ser Ala Val Phe Ser Lys Ser Val Gin Arg Gly 

H70 ins 118Q 

Glu Leu Ser Arg Ser Pro Ser Pro Phe Thr His Thr His Leu Ala Gin 
" H90 U95 120 

Gly Tyr Arg Arg Gly Ala Lys Lys Leu Glu Ser Ser Glu Glu Asn Leu 

1205 12 io i 215 

Ser Ser Glu Asp Glu Glu Leu Pro Cys* Phe Gin His Leu Leu Phe Gly 

1220 1225 1230 

Lys Val Asn Asn He Pro Ser Gin Ser Thr Arg His Ser Thr Val Ala 

1235 1240 1245 

Thr Glu Cy S Leu Ser Lys Asn Thr Glu Glu Asn Leu Leu Ser Leu Lys 

1250 1255 1260 

Asn Ser Leu Asn Asp Cys Ser Asn Gin Val He Leu Ala Lys Ala Ser 
LZbi> 1270 1275 128 

Gin Glu His His Leu Ser Glu Glu Thr Lys Cys Ser Ala Ser Leu Phe 

1285 1290 1295 

Ser Ser Gin Cys Ser Glu Leu Glu Asp Leu Thr Ala Asn Thr Asn Thr 

1300 1305 1310 

Gin Asp Pro Phe Leu He Gly Ser Ser Lys Gin Met Arg His Gin Ser 
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1315 1320 



1325 



Glu Ser Gin Gly Val Gly Leu Ser Asp Lys Glu Leu Val Ser Asp Asp 

1330 133 5 134 0 

Glu Glu Arg Gly Thr Gly Leu Glu Glu Asn Asn Gin Glu Glu Gin Ser 
1345 1350 1355 

Met Asp Ser Asn Leu Gly' Glu Ala Ala Ser Gly Cys Glu Ser Glu ihr 

1365 1370 1375 

Ser Val Ser Glu Asp Cys Ser Gly Leu Ser Ser Gin Ser Asp He Leu 

1380 1385 1390 

Thr Thr Gin Gin Arg Asp Thr Met Gin His Asn Leu He Lys Leu Gin 

1395 1400 1405 

Gin Glu Met Ala Glu Leu Glu Ala Val Leu Glu Gin His Gly Ser Gin 

1410 1415 1420 

Pro Ser Asn Ser Tyr Pro Ser lie He Ser Asp Ser Ser Ala Leu Glu 

1430 1435 
Asp Leu Arg Asn Pro Glu Gin Ser Thr Ser Glu Lys Ala Val Leu Thr 

1445 1450 1455 

Ser Gin Lys Ser Ser Glu Tyr Pro lie Ser Gin Asn Pro Glu Gly Leu 

1460 1465 1470 

Ser Ala Asp Lys Phe Glu Val Ser Ala Asp Ser Ser Thr Ser Lys Asn 
1475 1480 i4 8 5 

?JL Pr ° Gly Vai G1U Arg Ser Ser Pro Ser ^ Pro Ser Leu 

1490 1495 150Q 

Asp Asp Arg Trp Tyr Met His Ser Cys Ser Gly Ser Leu Gin Asn Arg 

-L^Uo 1510 1515 

Asn Tyr Pro Ser Gin Glu Glu Leu He Lys Val Val Asp Val Glu Glu 

1525 1530 1535 

Gin Gin Leu Glu Glu Ser Gly Pro His Asp Leu Thr Glu Thr Ser Tyr 

1540 1545 155Q 

Leu Pro Arg Gin Asp Leu Glu Gly Thr Pro Tyr Leu Glu Ser Gly He 
1555 i5 6 o 1565 

Ser Leu Phe Ser Asp Asp Pro Glu Ser Asp Pro Ser Glu Asp Arg Ala 
1570 1575 1580 

Pro Glu Ser Ala Arg Val Gly Asn He Pro Ser Ser Thr Ser Ala Leu 
1585 1590 ' 1595 iso 

Lys Val Pro Gin Leu Lys Val Ala Glu Ser Ala Gin Gly Pro Ala Ala 

1605 i 6 io 1615 

Ala His Thr Thr Asp Thr Ala Gly Tyr Asn Ala Met Glu Glu Ser Val 

1620 1625 1630 

Ser Arg Glu Lys Pro Glu Leu Thr Ala Ser Thr Glu Arg Val Asn Lys 

I 635 1640 1645 

Arg Met Ser Met Val Val Ser Gly Leu Thr Pro Glu Glu Phe Met Leu 

1650 1655 i 66 o 

Val Tyr Lys Phe Ala Arg Lys His His He Thr Leu Thr Asn Leu He 
1665 1670 i 67 5 168 

Thr Glu Glu Thr Thr His Val Val Met Lys Thr Asp Ala Glu Phe Val 
!685 1690 1695 
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Cys Glu Arg Thr Leu Lys Tyr Phe Leu Gly lie Ala Gly Gly Lys Trp 

1700 1705 1710 

Val Val Ser Tyr Phe Trp Val Thr Gin Ser lie Lys Glu Arg Lys Met 

• 1715 1720 1725 

Leu Asn Glu His Asp Phe Glu Val Arg Gly Asp Val Val Asn Gly Arg 

1730 1735 i 7 40 y 

Asn His Gin Gly Pro Lys Arg Ala Arg Glu Ser Gin Asp Arg Lys lie 

1750 2,755 

Phe Arg Gly Leu Glu_Ile Cys Cys Tyr Gly Pro Phe Thr Asn Met Pro 

Thr Asp Gin Leu Glu Trp Met Val Gin LeTcys Gly Ala Ser Jl^Val 

1780 1785 1790 

Lys Glu Leu Ser Ser Phe Thr Leu Gly Thr Gly Val His Pro He Val 

1795 1800 1805 

Val Val Gin Pro Asp Ala Trp Thr Glu Asp Asn Gly Phe His Ala lie 
P1 1815 1820 

Gly Gin Met Cys Glu Ala Pro Val Val Thr Arg Glu Trp Val Leu Asp 
l 5 1830 1835 18 4 

Ser Val Ala Leu Tyr Gin Cys Gin Glu Leu Asp Thr Tyr Leu He Pro 

1845 1850 1855 

Gin He Pro His Ser His Tyr 

1860 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE.SS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
AATCTTAGAG TGTCCCA 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
ATCTTAGTGT CCCACCT 
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(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
CAGAAAAAAA GGTAGAT 

17 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION; SEQ ID NO : 6 : 
CAGAAAAAAA AGGTAGA 

17 

(2) INFORMATION FOR SEQ ID NO : 7 ; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 7 : 
AGAGAATCCC AG G AC AG 

17 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

CC) STRANDEDNESS: single 

CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 
AGAGAATCCC CAGGACA 

17 
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(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 9 : 
AGGACCTGCG AAATCCA 

17 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGGACCTGTG AAATCCA 

17 



283 



